Language Identification through Parallel Phone Recognition
نویسندگان
چکیده
Language identification systems that employ acoustic likelihoods from languagedependent phoneme recognizers to perform language classification have been shown to yield high performance on clean speech. In this report, such a method was applied to language identification of telephone speech. Phoneme recognizers were developed for English, German, Japanese, Mandarin, and Spanish using hidden Markov models. Each of these processed the input speech and output a phoneme sequence in their respective languages along with a likelihood score. The language of the incoming speech was hypothesized as the language of the model having the highest likelihood. The main differences between this system and those developed in the past are that this system processed telephone speech, could identify up to five languages, and used phonetic transcriptions to train the language-specific models. The five-language, forced-choice recognition rate on 45-s utterances was 71.9%. On 10-s utterances the recognition decreased to 70.3%. In addition, it was found that adding word-specific phonemes to the training set had a negligible effect on language identification results. in j Aoosssion For i ms QRAkl öJr ifoar-rvouijoed Q Ju s t '■. '■: 1 o a t i c n ÄjT•TH s'(y'< hi^t.l m ZXOV.tlO-Qi $ ■/''vail and/or Speoial
منابع مشابه
Automatic Language Identification of Telephone Speech
II Lincoln Laboratory has investigated the development of a system that can automatically identify the language of a speech utterance. To perform the task of automatic language identification, we have experimented with four approaches: Gaussian mixture model classification; single-language phone recognition followed by language modeling (PRLM); parallel PRLM, which uses multiple single-language...
متن کاملComparison of four approaches to automatic language identification of telephone speech
AbstructWe have compared the performance of four approaches for automatic language identification of speech utterances: Gaussian mixture model (GMM) classification; single-language phone recognition followed by languagedependent, interpolated n-gram language modeling (PRLM); parallel PRLM, which uses multiple single-language phone recognizers, each trained in a different language; and languaged...
متن کاملLanguage identification using parallel sub-word recognition - an ergodic HMM equivalence
Recently, we have proposed a parallel sub-word recognition (PSWR) system for language identification (LID) in a framework similar to the parallel phone recognition (PPR) approach in the literature, but without requiring phonetic labeling of the speech data in any of the languages in the LID task. In this paper, we show the theoretical equivalence of PSWR and ergodicHMM (E-HMM) based LID. Here, ...
متن کاملParallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition
In phonotactic language recognition systems, the use of acoustic model adaptation prior to phone lattice decoding has been proposed to deal with the mismatch between training and test conditions. In this paper, a novel approach using diversified phonotactic features from parallel acoustic model adaptation is proposed. Specifically, the parallel model adaptation involves independent mean-only an...
متن کاملFusion of contrastive acoustic models for parallel phonotactic spoken language identification
This paper investigates combining contrastive acoustic models for parallel phonotactic language identification systems. PRLM, a typical phonotactic system, uses a phone recogniser to extract phonotactic information from the speech data. Combining multiple PRLM systems together forms a Parallel PRLM (PPRLM) system. A standard PPRLM system utilises multiple phone recognisers trained on different ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998